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1.7 Proof of optimality of the SPRT. For the Neyman-Pearson Lemma, we had a 
statement. Theorem 1.1.3, not involving losses or priors, and another (Theorem 1.1.8) 
when there are losses and a prior. Here, the statement of Theorem 1.5.1 again did not 
involve losses or priors, and did not involve the cost c per observation, but the proof to 
be given for it will do so, much as in Theorems 1.2.5 and 1.2.6 which showed that under 
some conditions Bayes decision rules are admissible. 

For a sequential randomized test {4>n} of P vs. Q as defined in the last section, we can 
represent the randomization as follows. Let A = {—1, 0, 1}. Let (O, JF, ^u) be a probability 
space on which there are i.i.d. random variables yi,y2, ■ ■ ■ , uniformly distributed in [0, 1]. 
Take the j/j to be independent of the Xj. Then given w e fi, let 



If uj is chosen from Q according to then the non-randomized sequential test 0^1^ is applied, 
the probabilities of decisions are just as in the randomized test. Randomized tests will, 
then, be represented by such (^^, where for each u, {(pn}n>o is a sequential test in the 
previously defined (non-randomized) sense. So, sequential randomized tests of P vs. Q are 
all "realizable" in a natural analogy with the notion defined in Sec. 1.3 and treated in Sec. 
1.4, and {(f>ni') '• n > 0, u> & Q} will be called a realized sequential test for P vs. Q. Here 
N will be the random n at which a choice of P or Q is made (or +oo if the choice is never 
made). So N depends on cj as well as {Xj}j>i. Error probabilities such as a{P,(f)) and 
average sample numbers such as Ep^^j^N are naturally defined for sequential randomized 
tests. 

For any given losses, let v := I^pg, the loss if Q is chosen when P is true, and 
w := Lqp, the loss if P is chosen when Q is true. If a prior tt is defined, let p := 7r(P) 
and g := 1—p := tv{Q). 

For a sequential randomized test (p — {4>n}j cost c per observation and p, v, etc. as 
above, the overall risk is defined as the total expected cost plus loss: 



Note that as a function of p, for fixed 0, P, Q, v and w, r(-, 0) is an affine (linear) function 
r{p, 0) = a + bp (since q = 1 —p), i.e. a polynomial of degree 1 in p (or a constant). To see 
this, note that on the right side in the above definition there are two terms times p and 
two terms times g, which don't otherwise depend on p or g. Since g = 1—p, this risk is an 
affine function of p. 

Given P, Q, p, c > 0, v > 0, and > 0, a which minimizes r(p,(f)) and has 
finite risk will be called Bayes, agreeing with previous definitions except that we now have 
additional terms depending on the cost per observation. 

For any test and x G X let 0*^^^ be the test defined by 




-1, 



ifyn<4>n{Xl,... ,Xn){-l) 
iiyn > l-0n(Xi,... ,Xn) (1) 



otherwise. 



rip, 0) := pva{P, 0) + qwa{Q, 0) + pcEp^^N + qcEq^^N. 



0n^''^(yi,--- ,yn) := (j)n+i{x,yi,... ,yn) 
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for all yi, ■ ■ ■ , Vn in X and n= 1,2.... 

An equation (pn = u for a realized test will mean that (f)'^ = u for (almost) all uj, 
where both cpn and u may depend on Xi, . . . , X^. 

It will be shown that for any test taking at least one observation, the risk equals the 
expected posterior risk after one observation. In other words, 

1.7.1 Lemma. If is a realized sequential test with (f)Q = 0, then 



where p^ is the posterior probability of P after observing x. 

Proof. It will be enough to prove this for non-randomized tests, then integrate. Let 
r{P,(t)) :— r{l,(j)) and r{Q^(^) := r(0,(/)). For f{x) := Rq/p{x) we have Px = 
p/{p + Qfi^)) and Qx := 1 — Px — Qf{x)/{p + qf{x)). Note that in this case the 
predictive measure 7 on X is pP + qQ. Recall that by Theorem 1.3.7, the posterior is 
well-defined for 7-almost all x and doesn't depend on the dominating measure. Next, 

r{Px.&^ = p,r(P,(/)(^)) + g,r(g,(^(-)). 

Then, decomposing the integral with respect to x over the two sets {/ < 00} and {/ = 00} 
gives 

/ r(p„0(-))(p + ?/(x))dP(x)+ / r{p,,^^-))qdQ{x) 

= [ pr{P,<i)^'=^)+qf{x)r{Q,<f>^''^)dP{x)+q [ r{pxA^"^)dQ{x) 

Jf<oo J f=oo 

(1.7.2) = pj r{P,(p^'''^)dP{x) + q J r{Q,(p^''^)dQ{x). 

On the other hand, r{p,(p) = pr{P,(p) + qr{Q,(f)). For x G write x — {xijX^^'^) 
where x^^^ := {xj}j>2 and (j){x) = (^o, {0n(2^i5 ■ ■ ■ tX^) }n>i)- Let N := N{x) be the 
least n such that (f)n{xi, ■ ■ ■ ,Xn) 7^ 0, in other words the time a decision is made. Let 
1{. . . } denote the indicator function of the set {•••}• Then 

r(P,0) = j L{P,^{x))dP°°{x) with L(P,0(a;)) := cN{x) + v1{^n = '^}, 

and where P°° is the distribution for which xi, x^-, - ■ ■ are i.i.d. with distribution P . By 
the Tonelli-Fubini theorem, 

r(P,0) = yy L{P,(^{(^xx,x^^^)))dP'^{x^^^)dP{xx). 



For y — x\ fixed, and since 4>Q = 0, the inner integral equals c + r(P, 0^^^). The corre- 
sponding fact for Q then combines to give 

rip A) = {p + q)c + pj r{P,4>^y^)dP{y) + q J r{Q,4>^y^)dQ{y), 

and using (1.7.2) completes the proof of Lemma 1.7.1. □ 

Now, if we multiply the cost c per observation and the losses v and w all by a constant 
h > 0, leaving p fixed, then we multiply r{p, cf)) by h for any test (p. (This can be considered 
a choice of units or scale for costs and losses.) So if (f) is Bayes for c, v and w, it is also for 
he, hv and hw. So in this proof (the proof of Theorem 1.5.1), letting h := l/{v + w), we 
can and will assume v + w = 1. 

Let r{p) := mi{r{p,4>) : (p E C} where C is the class of all realized sequential tests 
(f) with (f)Q = 0, so that N > 1 always. Then for all (p E C, r{p, (p) > c for < p < 1. If V 
is a test with Vq = and Vi = 1, then r{p, V) — c + pv, so c < r{p) < c + pv and r(-) is 
continuous at (see Fig. 1.7A). Likewise, c < r{p) < c + (1 — p)w implies r is continuous 
at 1. If < t < 1 and x, y E [0, 1], then for any test (p, we have by the affine form of 
r(p, (p) as a function of p (with other quantities fixed), as noted just after the definition of 
r{p, (p), that 

(1.7.3) r{tx + (1 - t)y, 4>) = tr{x, 0) + (1 - t)r{y, cP), 

and so 

ritx+{l-t)y) = mf{tr{x,(P) + {l-t)r{y,(P) : (p e C} > tr{x) + (1 - t)r{y). 

Thus r(-) is concave, and so continuous on the open interval (0, 1) (RAP, Theorem 6.3.4; 
— r is convex). So r(-) is continuous on the closed interval [0, 1]. 

Let R{p) :— inf0r(p, (p) (the infimum over all tests, not just those in C). Then R{-) 
can be approximated as follows: 

1.7.4 Lemma. For any £ > there is a function F from [0, 1] to the set of all realized 
sequential tests, where F has finitely many tests as possible values, each on an interval, 
and where for < p < 1, R{p) < r{p, F{p)) < R{p) + e. 

Proof. As with r(-), R{-) is concave. Let S be the test with S'o = 1 and T the test 
with To = — 1. So chooses Q and T chooses P, each with no observations [N = 0). 
Then r{p, S) = pv and r{p,T) = qw. If at least one observation is taken, the risk is r{p). 
The overall risk, as a function of p, is the minimum of these three risks (and could not be 
reduced by choosing at random among S, T and a test in C). So as shown in Fig. 1.7A, 

(1.7.5) R{p) = min(pv, r(p), (1 — p)t(;), 

so R{-) is continuous for < p < 1. Also, for any fixed test (p, r{p, (p) is affine (linear) in p 
by (1.7.3) and so continuous. For each p e [0, 1] there is a test (pp with r(p, (pp) < R{p)+s, 
and so r{u,(pp) < R{u) + e for all u in some relatively open interval Jp containing p in 
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c = cost per observation 
V = LpQ = loss when P is true, Q is chosen 
w = Lqp = loss when Q is true, P is chosen 
(c, V, w) normalized so that v + w = 1 



risk if P is 
chosen without 
any observations 

'1 — p)w = r{p, (j)) 

for 00 = ~1 



pv — risk if Q 
is chosen without 
any observations 
= r(p, 0) for 00 = 1 




-- I'll; 



minimum risk 

if at least 

one observation 

is taken 



P 



Fig. 1 .7.A. Minimum risl< for sequential tests 



[0, 1]. By compactness these intervals have a finite subcover Jp(j), i = 1, . . . ,m. Then we 
can take disjoint subintervals /j C Jp(i) which cover [0,1] (but are no longer open), and 
set F{u) = 4)p{i) for u & li for each i = 1, . . . ,m. □ 

We also see in (1.7.5) that if a Bayes test cj) oi P vs. Q exists, and pv is strictly less 
than either of the other two terms on the right, we must have 4>q = 1. If {l—p)w is strictly 
the smallest of the three terms, then 0o = ~1- And if r{p) is strictly smallest, then (pQ = 0. 

Now if r{w) < vw, then the function g{p) :— r{p) — pv is concave and continuous 
on [0, 1], ^(0) > 0, and g{w) < 0. Thus by the intermediate value theorem, g has a root 
PL with < PL < w (again, see Fig. 1.7A). Since g{0) > and g is concave, the root is 
unique (if < a < 6 < 1 and g{b) = then g{a) > by concavity). Likewise, the function 
h{p) := r{p) — (1 — p)w has a unique root pu with w < pu < Or if r{w) > vw, then 
let PL := Pu '■= w. 

1.7.6 Proposition, li pl < p < pu then 

R{p) - c + J R{p^)d{pP + qQ){x). 

Proof. In the given range, R{p) = r{p). Let be any test with 0o = 0. Then by Lemma 
1.7.1, 

r(p,0) = c + J r{p^,(f)^''^)d{pP + qQ){x) > c + J R{Px)d{pP + qQ){x). 

Taking the infimum over such (j) (cj) e C) gives 

R{p) = r{p) > c + J R{p^)d{pP + qQ){x). 

(Note that since x is measurable and R is continuous, x ^ R{px) is measurable.) 

For the reverse inequality, given £ > and a fixed p, take F from Lemma 1.7.4 and 
define a test -q with ?7o = and ■q^'^^ := F{px) for all x. Then by Lemma 1.7.1 again, 

R{p) < r{p,r]) = c + J r{pj:,ri^''^)d{pP + qQ){x) 

< c + e + J R{pa,)d{pP + qQ){x). 

Letting £ | finishes the proof. □ 

The Neyman- Pearson lemma on admissibility (Theorem 1.1.3) was proved above be- 
fore the characterization of Bayes tests (Theorem 1.1.8). In contrast, for sequential tests, 
the following sufficient condition for the Bayes property will be proved, then used in the 
rest of the proof of Theorem 1.5.1. 
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1.7.7 Theorem. Suppose given P, Q, c, p and v with 0<w := 1 — v < 1. Let 

q A := p(l -p[/)/(gpf/) and 5 := p{l - Pl) / {qPl)- Then ^ := SPRT(^,S) 

is Bayes for P vs. Q. 

Conversely, any Bayes sequential test of P vs. Q must, almost surely for and 
for each n, if no previous decision was made, continue (take at least one more observation) 
when A < Tn < or choose P if r„ < or Q if > B. 

Proof, li p> pu, then ^ > 1 and ij) chooses P immediately, attaining r{p,ip) — R{p) = 
(1 — p)w. Likewise if p < and p < pu, then A < B < 1 and i/j chooses Q immediately, 
attaining r(p, ifj) = R{p) = pv. So is Bayes in these cases. 

If r{w) > vw, then pL = pu = w and p > pu or p < pl for all p. So we can assum 

r{w) < vw, 

We are left with the main case Pl < P < Pu- Let p{x) := Px- Now p^ > pu is 
equivalent to p>pu{p-\- gfix)) or f{x) = ri(x) < A, while < is likewise equivalent 
to ri{x) > B. Thus if < p < piy? but p{Xi) < pL or p{Xi) > pu, then makes a 
choice after one observation, achieving a risk c + R{p{Xi)), matching the corresponding 
part of the integral in Proposition 1.7.6. The case pl < p{Xi) < pu remains. After n 
observations, the posterior probability of P is 

pin) .- pin)(^xi,... ,Xn) := p/ {p + Tnq) , 

where r„ := Hk^xu fi-^j)- Now pl < p^^^ if and only if P > r„. Also, p^^'' < pu if and 
only ii rn> A. 

Moreover, by the remarks after the proof of Lemma 1.7.4, applied after the nth ob- 
servation, if a Bayes sequential test has not made a decision previously, it must choose P 
if r„ < A, choose Q if r„ > P, and continue if ^ < rn < P, proving the last statement in 
Theorem 1.7.7. 

Let Cn be the event that A < rj < B for j = 0, 1, . . . , n, so that ijj takes at least n + 1 
observations. For n > 1 let Vn := Cn-i n {r^ < A} and Wn := Cn-i n {r^ > B}. 

1.7.8 Lemma, li pl < p < pu and n — 1,2, ... , then 

n 

Rip) > Tnip) := J]p[P^(W^,)(^ + jc)]+g[g^(y,0(«^ + ic)]+jcbP^(V,0+gg^(t^i^ 
i=i 

Proof. The proof will be by induction on n. For n = 1, 

Tiip) = c[ipP + qQ)iWi Li Vi)]+vpP{Wi)+wqQiVi) < Rip) 

by Proposition 1.7.6 as follows. The coefficient of c above is < 1. For p^ > pu, which is 
the event Vi, Ripx) = (1 — Px)w. Likewise for p^ < Pl, which is Wi, Ripx) = PxV. 

For the induction step, note that T^ip) is the risk restricted to the complement of C„. 
If x{n) := (xi, . . . , Xn), then on Cn, Pl < T^xin) < '^Rj so Proposition 1.7.6 applies to the 
induction step from n to n + 1 just as in the proof for n = 1. □ 
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On the other hand, 



= pr(P,V) + 9r(Q,V) = pj L{P,i;{x))dP^ix) + q J L{Q,i;{x))dQ°° (x) 



<Tn{p)+p L{P,i;{x))dP"^{x)+q L{Q,i;{x))dQ"^{x). 

As n — ^ oo, the events C„ decrease, P^{Cn) [ and Q°^{Cn) i by Lemma 1.5.2. On 
Cn-i \ Cm L{n,'i/j{x)) < nc + m8ix{v,w) for n = P or Q. By Lemma 1.5.2 and the fact 
that + nc)^^ converges whenever 7 < 1, for any constants a, c, it follows that 

the integrals in the last display are finite and decrease to by dominated convergence or 
monotone convergence. So r(p, t/^) < R(p), and r(p,i(^) = R{p) and '4> is Bayes, finishing 
the proof of Theorem 1.7.7. □ 

Now, the dependence of pl and pu on c and v (with v -\- w = 1) will be indicated in 
the notation, setting PL := pl(c,v), pu '■= puic,v). 

1.7.9 Lemma. The functions pl{-,v) and pu{-,v) are continuous in c for c > and each 
fixed V. 

Proof. For each p and (f), the risk r{p, 0) is a non-decreasing function of c, which will be 
called r{p,(f):c) or r{p, (p; c,v). Thus, r{p;c,v) := r{p;c) :— r{p) is non-decreasing in c 
for each fixed p and v. Given c and 5 > 0, let (f)he a, test such that r{p, (f); c) < r(p; c) + S. 
Let M :— max{Ep^(i)N, Eq^^pN) < 00 since c > and 

M < [r(p;c) + S]/{c-mm(p,q)). 

Take any ci < c < C2. Then 

M < Ml := [r(p;c2) +5]/(ci • min(p,g)). 

Then if ci < c < C2, for any h > such that ci<c<c-|-/i<C2, 

r{p;c + h) < r{p,(t);c + h) < r(j),(f);c) + Mih < r{p;c)+Mih + S. 

Letting d I gives that < r{p;c + h) — r{p;c) < Mih. In other words r{p;c) is, for 
c > 0, locally Lipschitz in c. So, it is continuous in c. Recall that r{p; c) > c for all p. 
Thus there is a unique smallest c := cq := co{v) := Co{v,p,P,Q) 

such that r{w;c) > vw, with r{w;co) — vw, or equivalent ly such that pl — pu — w. 
Thus for c > Co, pl{c,v) = w, which is continuous (being constant) in c. For < c < 
Co, r{w; c) < vw and Pl(c, v) is the unique solution p for < p < 1 of r{p; c) = pv, with 
r{p; c) > pv if p < pl and r(p; c) < pi? if p > pl- For any d > c, we have 

vpL{d,v) = r{pL{d,v);d,v) > r{pL{d,v);c,v), 

axid hence pL{d,v) > pl(c, v), so is non-decreasing in c. 
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Suppose Cn i c < Co and Phicn, v) [ p' > Pl{c, v). Then 



r{p';c,v) < vp' < r{p';Cn,v) 



r{p';c,v), 



a contradiction. So pl is continuous in c from the right. Likewise if t c < cq and 
PL{cn,v) T < Pl{c,v), then 



again a contradiction. Thus is continuous in c. The proof for pu is symmetrical. □ 

1.7.10 Lemma. For any e > 0, P 7^ Q, < ^ < 1 < S < oo, and V = SPRT(A,S), 
there exist c > 0, v e (0, 1), and p e (0,£) such that ip is Bayes for the given P, Q, p, c 
and V, and another such p with 1 — e < p < 1. 

Proof. Fix V e{l- e/B, 1) and let 



Then j is a continuous function of c > 0, by Lemma 1.7.9, since < < 1 and < pu < 1. 
For c large enough, p^ = Pui ^ind then j{c) = 1. As n — > 00, the infimum of error 
probabilities 

inf{max(Q;(P, 0),q;(Q,0)) : G Cn} ^ 

where is the class of ordinary (non-sequential) tests ,0;^) , as can be seen 

from 1.5.3 with B and 1/A (in 1.5.3, not here) large, then 1.5.2. Thus as c — 0, 
supo<p<i r(p; c) — > 0, so pl{c,v) — > and pu{c,v) 1. So j{c) — > 0. Thus by the 
intermediate value theorem there is a c such that j{c) = A/B. 

For that c, let Pi := Pl{c,v), pu := pu{c,v) and p := plB / {1 - pl + plB), with 
q := 1-p. Then p/q =plP/(1-pl) =Pc/^/(l-Pc/), so by Theorem 1.7.7, SPRT(A, 5) 
is Bayes. We have < p < plB < wB < £ as desired. To find another p > 1 — e, choose 
instead w G (0, Ae), proceeding otherwise as above. Then g = {1 —pu)/{l — pu +PuA) < 
{l-pu)/A < v/A < e. □ 

Proof of Theorem 1.5.1. The optimality of SPRTs will be proved also as compared 
to sequential randomized tests of P vs. Q. Given i/j = SPRT(A, P), with < A < 1 < 
B < 00, and £ > 0, Lemma 1.7.10 gives a, v, w — 1 — v, c > 0, and a p with < p < e 
for which ip is Bayes. Let cj) be another (randomized) test with a{P,(j)) < a{P,'ip) and 
a(Q,(/>) <a(Q,V). 

Case I. Suppose Ep^^N < 00 and Eq^^N < 00. Then by the Bayes property of ip, 

< r{p,(l)) -r{p,i;) = pv{a{P,(j)) -a{P,ij)) + qw{a{Q,(i)) -a{Q,i;)) 

+pc{Ep^^N-Ep^^N) + qc{EQ,^N-EQ,^N) < ec{Ep^^N - Ep^^N) + c{Eq,^N - Eq^^N). 

Letting e J, 0, it follows that Eq^^N > Eq^^N. Likewise, taking p > 1 — e from Lemma 
1.7.10 so that q < e and letting s I gives Ep^^N > Ep^^N as desired. 



r{p';c,v) > vp' > r{p';Cn,v) 



r{p';c,v), 



j{c) := pl{c,v)[1-pu{c,v)]/\pu{c,v){1-pl{c,v))]. 
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Case II. Suppose Ep^^N = +00. (Note: this case has httle practical interest, as one would 
not want to use a test with infinite risk (expected cost of observations) under P, even if 
it had a smaller expectation for N under Q.) For a proof by contradiction, suppose that 
Eq^^N < Eq^^N. 

Case Ila: Suppose also a{P,ip) > and a{Q,'ip) > 0. Let C := a{Q,ip)/2, D := 
2/ck(P, tA), and T := SPRT(C, D). Then by Lemma 1.5.2, Ep^tN < 00 and Eq^tN < 00. 
By Lemma 1.5.3, a{P,T) < a{P,%l}) and a{Q,T) < a{Q,il)). Let T{t) be a randomized 
sequential test with Pr(T(t) = T) = t and Pr(T(t) = (f)) = 1 — t. For t small enough, 

a(P,T) <a(P,V'), a{Q,T{t))<a{Q,i;), a.nd EQ^Tit)N < Eq^^N. 

For each = 1, 2, . . . , let T{t, k) be the test with T{t, k)j = T{t)j for j <k and T(t, k)k = 1, 
so that T(t, fc) acts the same as T{t) through the first k—1 observations, then T{t, k) chooses 
Q. For T(t, k) we have N < k, so -E'p,T(t,fe)-^ < 0° fo^ any /c. We have 

a{Q,T{t,k)) < a{Q,Tit))<aiQ,iP) and £;Q,T(t,fc)A^ < EQ^T{t)N < Eq,^N, 

while for k large enough, a{P,T{t, k)) < a{P,il)). This contradicts Case I. 
Case lib: oi{P,ip) = 0. So also a{P,(l)) — 0. Note that a{Q,ip) < 1 by Lemma 1.5.3 
since ^ < 1. If P{ri > 1) > 0, then for some e > 0, P(ri > 1 + e) > e. For each n, 
P"(rj > (1 + ey for j = 1, . . . , n) > e^. For some n, (1 + e)^ > 5 so ck(P, V') > > 0, 
a contradiction. Thus P(ri > 1) = 0. Since Q ^ P, Q{ri < 00} = / ridP < 1. Let 
p := {n = 00}, so Q{F) > 0. 

Case //6-Z: q:((5,'0) = also. Then just as P(ri > 1) = we also have Q{ri < 1) = 0. So 
ri = 0, 1 or +00 almost everywhere for P + Q, and the values of r„ are also 0, 1 or +00 
almost surely for P"' or Q^. Both error probabilities are for both tests. So on the event 
that ri = r2 = ■ ■ ■ ~ rn — 1, almost surely neither test has made a decision yet. Thus 
which decides the first time 7^ 1, has Eq^^N < Eq^^pN as desired. 

Case IIb2: a{Q,ip) > 0. Let U{k) be the test which takes k observations and chooses Q if 
rfe = +00, otherwise P. For k large enough, a{Q,U{k)) = (1 — Q{F)Y < a{Q,ip), while 
a{P,U{k)) = 0. Let Ug do with probability s and U{k) with probability 1 — s, where 
< s < 1. Then a(P, Us) = 0, a{Q, Us) < a(Q, V'), and Eq^Us^ < 00. 

Let V{j) be the test which equals Us before time j, and after the jth observation, 
if no decision was made previously, V{j) chooses P. Then a{P, V{j)) — 0. For j large 
enough, a{Q,V{j)) < a{Q,ip). So by Case I 

Eq,^N < EQy^j)N < Eq^uN = sEQ,^N+{l-s)EQ^u(k)N < sEq,^N + {1 - s)k. 

Letting s j 1, we get Eq^^N < EQ^(j)N, finishing cases IIb2, lib and II. 

Case III, interchanging P and Q in Case II, is symmetrical. So Theorem 1.5.1 is proved. □ 

Risks can be defined without priors, as in Sec. 1.2, as follows. These will be denoted 
here by p(-, ■) to distinguish them from risks r{p, ■) depending on priors. Let 

p(//,0) := L^^a{iJi,4>) + cE^j,^^N 
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where (/ijiy) = {P,Q) or {Q,P) and cj) is any sequential test of P vs. Q. Recall that for 
two such tests (f) , r] , (p ^ r] means (f>) < p{p., ry) for fu. = P and fi = Q, and (f) ~< i] means 
that in addition, the inequality is strict for // = P or Q. A sequential test is inadmissible 
if there exists (p ~< r], otherwise admissible. 

1.7.11 Corollary. For given c, v, w, P and Q, if A and B are defined as in Theorem 1.7.7, 
for any p = 1 — q with < p < 1, then ip := SPRT(A, B) is admissible. 

Proof. We have for < p < 1 and any sequential test (p, r{p, (p) = pp{P, (p) + QpiQ, (p)- 
If (p -< ip then since < p < 1 it would follow that r{p, (p) < r(j), ip), contradicting Theorem 
1.7.7. □ 

The Corollary is really a special case of Theorem 1.2.6. 

PROBLEM 

1. Suppose that in a sequential test of P vs. Q, there is a loss of v — LpQ —$20 if P 
is true and Q is chosen, and a loss w = Lqp =$20 if Q is true and P is chosen. Let 
the cost per observation be c =$1. Let p be the prior probability of P. Suppose that 
the minimum risk over all tests taking at least one observation is r{p) = 1 + 5p(l — p) 
dollars. 

(a) Find pl, such that if p < pl but not if p > pl, we should choose Q without taking 
any observation. 

(b) Find pn, such that if p > pu but not if p < pr, we should choose P without taking 
any observation. 
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